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Detailed Description Text - DETX (13) : 

Bank address input connections, BAO and BAl define which bank an 
ACTIVE, 

READ, WRITE, or BLOCK PROTECT command is being applied. The DQ0-DQ15 
connections 143 are data bus connections used for bi-directional data 
communication. Referring to FIG. IB, a VCCQ connection is used to 
provide 

isolated power to the DQ connections to improved noise immunity. In 
one 

embodiment, VCCQ=Vcc or 1.8V .+-. 0.15V. The VSSQ connection is used 
to 

isolated ground to DQs for improved noise immunity. The VCC 
connection 

provides a power supply, such as 3V. A ground connection is provided 
•through 

the Vss connection. Another optional voltage is provided on the VCCP 
connection 144. The VCCP connection can be tied externally to VCC, 
and sources 

current during device initialization, WRITE and ERASE operations. 
That is, 

writing or erasing to the memory device can be performed using a VCCP 
voltage, 

while all other operations can be performed with a VCC voltage. The 
Veep 

connection is coupled to a high voltage switch/pump circuit 145. 

Detailed Description Text - DETX (18) : 

In general, the synchronous flash memory is configured similar to 

a 

mult i -bank DRAM that operates at low voltage and includes a 
synchronous 

interface. Each of the banks is organized into rows and columns. 
Prior to 

normal operation, the synchronous flash memory is initialized . The 
following 

sections provide detailed information covering device initialization, 
register 

definition, command descriptions and device operation. 
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Detailed Description Text - DETX (19) : 

The synchronous flash is powered up and initialized in a 
predefined manner. 

After power is applied to VCC, VCCQ and VCCP (simultaneously) , and 
the clock 

signal is stable, RP# 14 0 is brought from a LOW state to a HIGH 
state. A 

delay, such as a 100 .mu.s delay, is needed after RP# transitions 
HIGH in order 

to complete internal device initialization. After the delay time has 
passed, 

the memory is placed in an array read mode and is ready for Mode 
Register 

programming or an executable command. After initial programming of a 
non-volatile mode register 147 (NVMode Register) , the contents are 
automatically loaded into a volatile Mode Register 148 during the 
initialization. The device will power up in a programmed state and 
will not 

require reloading of the non-volatile mode register 147 prior to 
issuing 

operational commands. This is explained in greater detail below. 

Detailed Description Text - DETX (30) : 

To allow for maximum power conservation, the synchronous flash 
features a 

very low current, deep power-down mode. To enter this mode, the RP# 
pin 140 

(reset/power-down) is taken to VSS.+-.0.2V. To prevent an 
inadvertent RESET, 

RP# must be held at Vss for 10 0 ns prior to the device entering the 
reset mode . 

With RP# held at Vss, the device will enter the deep power-down mode. 
After 

the device enters the deep power-down mode, a transition from LOW to 
HIGH on 

RP# will result in a device power-up initialize sequence as outlined 
herein. 

Transitioning RP# from LOW to HIGH after entering the reset mode but 
prior to 

entering deep power-down mode requires a 1 .mu.s delay prior to 
issuing an 

executable command. When the device enters the deep power-down mode, 
all 

buffers excluding the RP# buffer are disabled and the current draw is 
low, for 

example, a maximum of 50 .mu.A at 3.3V VCC . The input to RP# must 
remain at 

Vss during deep power-down. Entering the RESET mode clears the 
Status Register 

134 and sets the ISM 132 to the array read mode. 
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Detailed Description Text - DETX (36) : 

Upon power-up and prior to issuing any operational commands to the 
device, 

the synchronous flash is initialized. After power is applied to VCC, 
VCCQ and 

VCCP (simultaneously) , and the clock is stable, RP# is transitioned 
from LOW to 

HIGH. A delay (in one embodiment a 100 .mu.s delay) is required 
after RP# 

transitions HIGH in order to complete internal device initialization. 
The 

device is in the array read mode at the completion of device 
initialization, 

and an executable command can be issued to the device. 

Detailed Description Text - DETX (54) : 

In prior memories, a Vcc detector circuit could be used that would 
signal if 

the power applied to the device was high enough to read the register. 
Once the 

power was sufficient, then the circuitry could start to read these 
fuse 

elements arid load them into some volatile registers. These Vcc 
detector 

circuits are not reliable such that the level that they detect varies 
widely. 

In a typical case, these detectors could change from a detection 
level of 1.4v 

to 2 . 7v depending on process and temperature variations. Thus, 
reading of the 

fuses needed to be done at a worst-case situation, and the circuitry 
is very 

complicated and large because reading Flash cells with 1.4v is 
difficult . 

Detailed Description Text - DETX (56) : 

In one embodiment, the present invention requires the memory 
controller of 

the system to issue a device initialize command to the memory. The 
Flash then 

would start reading data from fuse elements. Since the Vcc is good 
at that 

time, the device uses the normal circuits on the chip to do the data 
reads and 

is smaller and more effective. In one embodiment, therefore, the 
present 

invention receives an initialize command that indicates that Vcc is 
at an 

appropriate level. The memory then reads the non-volatile "fuses". 
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Detailed Description Text - DETX (57) : 

In another embodiment, the Flash memory uses the reset connection 
(RP#) to 

perform an initialization process. The RP connection is used for 
multiple 

functions in the memory. One function is to allow access to 
protection 

circuitry in the memory. By providing an elevated voltage on the RP 
connection, the device protection modes can be ignored, and the 
memory protect 

register can be edited. The RP connection is also used to reset the 
device on 

power-up. Here the RP pin receives a voltage that transitions to a 
de-asserted 

(high) state when the memory is powered up. The memory then begins 
an 

initialization process, including reading the fuses. The memory 
includes a 

"filter" that prevents an initialization sequence from being 
performed if the 

RP connection is not low for a predetermined time. For example, in 
one 

embodiment the RP connection must be low (Vss) for at least 100 ns 
prior to 

entering a reset/initialization operation. This time requirement 
prevents 

accidental reset due to noise on the RP connection. 

Detailed Description Text - DETX (59) : 

The present invention, therefore, can also perform initialization 
operations 

based on the RP connection. Two different initialization schemes can 
be used 

with the present synchronous Flash memory . The first uses an 
initialize 

command from the memory controller of the system. The second uses 
the RP 

connection to begin initialization. This embodiment reduces the need 
for the 

system to provide a specific command, which may be difficult in some 
systems . 

Detailed Description Text - DETX (62) : 

As described above, the initialization operation can be started 
via hardware 

or software once Vcc has reached a proper value. The initialization 
operation 

requires an undetermined amount of time, and memory operations cannot 
be 
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performed until the initialization is done. One choice for the end 
user is to 

wait a specified period of time, such as 100 .mu.s, after power up 
and issuance 

of a proper command. This period of time is selected to be more time 
than 

required to perform the initialization operation. Thus, after this 
time 

period, the memory is initialized and ready. The initialization 
sequence, 

however, may not take that much time. In one embodiment, about 40 to 
45 . mu . s 

is needed to finish an Initialization sequence. 
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Detailed Description Text - DETX (29) : 

Turning now to FIG. 1, in accordance with one embodiment, software 
that uses 

the FLAT to allow multiple code and data images to be stored in FLASH 
memory , 

begins on power up or system reset with the BIOS executing and 
performing 

system initialization and Power on Self Test activities (block 110) . 
The 

contents of the FLASH memory may be validated by checking the CRC 
stored at 

field 96 in the FLASH memory, as indicated in block 112. At this 
point, the 

BIOS selects the boot loader (block 114) to execute by scanning ^the 
FLAT and 

selecting the entry marked as the boot loader. The boot loader then 
uses the 

FLAT to find where in the FLASH memory the primary operating system 
is located 

(block 116) , loads the operating system at the appropriate address in 
system 

memory (block 118) and starts its execution (block 120) . 

Current US Original Classification - CCOR (1) : 
713/1 

Current US Cross Reference Classification - CCXR 
(2) : 

713/2 
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Brief Summary Text - BSTX (5) : 

A flash EEPROM generally includes logic circuits, and after a 
setup of a 

power supply voltage/ the logic circuits should be initialized with a 
predetermined state. Thus, a power on reset circuit is used to 
generate a 

signal capable of initializing the logic circuits as soon as the 
power supply 
voltage is set up. 

Current US Original Classification - CCOR (1) : 
327/143 
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Detailed Description Text - DETX (14) : 

To speed up the operation of the present invention, a block RAM is 
included 

and used to store a table representing the LBAs, corresponding PBAs 
and 

relevant data. FIG. 7 shows a schematic representation of the table 
144 stored 

within the block RAM for the present invention. The block RAM table 
includes 

an entry 132 for each LBA in the system file partition plus some 
spare 

locations for increased performance. The entries for each LBA 0 
through N are 

initialized at power up to a corresponding cluster PBA 0 through N by 
reading 

the LBA field 108 in the flash extension. An entry for each PBA of 
the spare 

clusters is also included within the table. Each entry 132 includes 
a latest 

physical block address of the cluster (PBA) field 134 of the 
information 

associated with an LBA as well as a mirror of the old flag 138, the 
used flag 

140, and the defect flag 142 associated with the actual physical 
block address 

of the cluster in the semiconductor non-volatile mass storage memory. 
Also, 

the field 136 represents the most recent data or sector within the 
given 

physical block address associated with the given LBA. Therefore, if 
the block 

size (cluster size) is 8, and for example the most recent sector 
within this 

cluster is 5, the field 136 is loaded with the value 5 . This will 
help the 

controller during an access to locate the most recent sector within 



the 
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cluster. Alternatively, this field can also be removed if the 
controller reads 

every sector within the cluster to locate the most recent system 
file. 

Current US Original Classification - CCOR (1) : 
711/103 
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The advantages of using message passing over shared memory for certain types of 
communication and synchronization have provided an incentive to integrate both models 
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Message passing and shared memory are two techniques parallel programs use for 
coordination and communication. This paper studies the strengths and weaknesses of these 
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two mechanisms by comparing equivalent, well-written message-passing and shared- 
memory programs running on similar hardware. To ensure that our measurements are 
comparable, we produced two carefully tuned versions of each program and measured them 
on closely-related simulators of a message-passing and a shared-memory machine, ... 

The Rio file cache: sui^iving operating system crashes 

Peter M. Chen, Wee Teck Ng, Subhachandra Chandra, Christopher Aycock, Gurushankar 
Rajamani, David Lowell 

September 1996 Proceedings of the seventh international conference on Architectural 
support for programming languages and operating systems, volume 3i , 
30 Issue 9 , 5 

Full text available* fj(!lpdf(1.12 MB) Additional Information: full citation , abstract , references , citings , index 

terms 

One of the fundannental limits to high-performance, high-reliability file systems is memory's 
vulnerability to system crashes. Because memory is viewed as unsafe, systems periodically 
write data back to disk. The extra disk traffic lowers performance, and the delay period 
before data is safe lowers reliability. The goal of the Rio (RAM I/O) file cache is to make 
ordinary main memory safe for persistent storage by enabling memory to survive operating 
system crashes. Reliable memory enables a syste ... 

Session 13: scheduling and operating systems: Application-specific protocols for user- 
level shared memory 

Babak Falsafi, Alvin R. Lebeck, Steven K. Reinhardt, loannis Schoinas, Mark D. Hill, James R. 
Larus, Anne Rogers, David A. Wood 

November 1994 Proceedings of the 1994 ACM/IEEE conference on Supercomputing 

Full text available: ^pdf(1.12 MB) Additional Information: full citation , abstract , references , citings 

Recent distributed stiared memory (DSN) systems and proposed shared-memory mactiines 
have implemented some or all of their cache coherence protocols in software. One way to 
exploit the flexibility of this software Is to tailor a coherence protocol to match an 
application's communication patterns and memory semantics. This paper presents evidence 
that this approach can lead to large performance Improvements, It shows that application- 
specific protocols substantially Improved the performance oft 



^ Integrating reliable memory in databases 
Wee Teck Ng, Peter M. Chen 

August 1998 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 7 Issue 3 

Full text available: ^pdfn23.18 KB) Additional Information: full citation , abstract , index terms 

Recent results in the Rio project at the University of Michigan show that it is possible to 
create an area of nnain memory that is as safe as disk from operating system crashes. This 
paper explores how to integrate the reliable memory provided by the Rio file cache into a 
database system. Prior studies have analyzed the performance benefits of reliable memory; 
we focus instead on how different designs affect reliability. We propose three designs for 
integrating reliable memory into databases: non ... 

Keywords: Main memory database system (MMDB), Recovery, Reliability 
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May 2001 ACM SIGPLAN Notices , Proceedings of the ACi^ SIGPLAN 2001 conference 

on Programming language design and implementation, volume 36 issue 5 
Full text available- i p3 pdf(1.60 MB) Additional Information: full citation , abstract , references , citings , index 
^ terms 



g e of 



Results (page 1): ^synchronous -fflash H-memory +initialize 



Page 3 of 5 



This paper presents the design and implementation of Event-driven State-machines 
Programming (ESP)— a language for programmable devices. In traditional languages, like C, 
using event-driven state-machine forces a tradeoff that requires giving up ease of 
development and reliability to achieve high performance. ESP is designed to provide all of 
these three properties simultaneously. 

ESP provides a comprehensive set of features to support development of compact and 
modular programs. ... 

s Efficient synchronization: let them eat QOLB 
Alain Kagi, Doug Burger, James R. Goodman 

May 1997 ACM SIGARCH Computer Architecture News , Proceedings of the 24th 

annual international symposium on Computer architecture, volume 25 issue 2 
Full text available' fi!| pdf(2.04 MB) Additional Information: full citation , abstract , references , citings , index 
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Efficient synchronization primitives are essential for achieving high performance in fine- 
grain, shared-memory parallel programs. One function of synchronization primitives is to 
enable exclusive access to shared data and critical sections of code. This paper makes three 
contributions. (1) We enumerate the five sources of overhead that locking synchronization 
primitives can incur. (2) We describe four mechanisms (local spinning, queue-based locking, 
collocation, and synchronized prefetch) that r ... 

^ Building reliable mobile-aware applications using the Rover toolkit 
Anthony D. Joseph, M. Frans Kaashoek 
October 1997 Wireless Networks, Volume 3 issue 5 

Full text available' df(371 04 KB) Additional Information: full citation , abstract , references , citings , index 
'^^■^^ '■ terms 

This paper discusses extensions to the Rover toolkit for constructing reliable mobile-aware 
applications. The extensions improve upon the existing failure model, which addresses client 
or communication failures and guarantees reliable message delivery from clients to server, 
but does not address server failures (e.g., the loss of an incoming message due to server 
failure) (Joseph et al., 1997). Due to the unpredictable, intermittent communication 
connectivity typically found In mobile clien ... 

^0 Performance experiences on Sun's Wildfire prototype 
Lisa Noordergraaf, Ruud van der Pas 

January 1999 Proceedings of the 1999 ACM/IEEE conference on Supercomputing 
(CDROM) 
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11 Location-awareness and interworking: Proximity services supporting network virtual 
memory in mobile devices 

Emanuele Lattanzi, Andrea Acquaviva, Alessandro Bogliolo 

October 2004 Proceedings of the 2nd ACM international workshop on Wireless mobile 
applications and services on WLAN hotspots 

Full text available: ^ pdf(1 92.89 KB) Additional Information: full citation , abstract , references , index terms 

Wireless networked embedded terminals like personal digital assistants, cell-phones or 
sensor nodes are typically memory constrained devices. Tliis limitation prevents the 
development of applications that require a large amount of run-time memory space. In a 
wired cum wireless scenario, a potentially unlimited amount of virtual memory can be found 
on remote servers installed on the wired network. However, virtual memory access requires 
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performance constrained and lossless data flows against te ... 

Keywords: mobility management, network swapping, proximity service, wireless networks 
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Robustness: Using model checking to debug device firmware | 
Sanjeev Kumar, Kai Li 

December 2002 ACM SIGOPS Operating Systems Review, Volume 36 issue si 
Full text available: ^ pdf(1.72 MB) Additional Information: full citation , abstract , references 

Device firmware is a piece of concurrent software that achieves high performance at the cost 
of software complexity. They contain subtle race conditions that make them difficult to 
debug using traditional debugging techniques. The problem is further compounded by the 
lack of debugging support on the devices. This is a serious problem because the device 
firmware is trusted by the operating system. Model checkers are designed to systematically 
verify properties of concurrent systems. Therefore, mod ... 

15 A distributed 3D graphics library | 
Blair Maclntyre, Steven Feiner 

July 1998 Proceedings of the 25th annual conference on Computer graphics and 
interactive techniques 

Full text available: ^ pdf(355.83 KB) Additional Information: full citation , references , citings , index terms 
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December 1995 ACM SIGOPS Operating Systems Review , Proceedings of the fifteenth 
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^9 Static analysis to reduce synchronization costs in data-parallel programs 
Manisli Gupta, Edith Schonberg 

January 1996 Proceedings of the 23rd ACM SIGPLAN-SIGACT symposium on Principles 
of programming languages 
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20 The nesC language: A holistic approach to networked embedded systems | 
David Gay, Philip Levis, Robert von Behren, Matt Welsh, Eric Brewer, David Culler 
May 2003 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 2003 conference 

on Programming language design and implementation, Volume 38 issue 5 
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We present nesC, a progrannming language for networked embedded systenns that represent 
a new design space for application developers. An exannple of a networked embedded 
system is a sensor network, which consists of (potentially) thousands of tiny, low-power 
"motes," each of which execute concurrent, reactive programs that must operate with 
severe memory and power constraints. nesC's contribution is to support the special needs of 
this domain by exposing a programming model that incorporates ... 

Keywords: C, TinyOS, components, concurrency, data races, first-order, modules, nesC, 
programming languages 
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^ Verification: Model based estimation and verification of mobile device performance 
Gopal Raghavan, Ari Salomaki, Raimondas Lencevicius 

September 2004 Proceedings of the fourth ACM international conference on Embedded 
software 

Full text available: ^ pdf(252.67 KB) Additional Information: full citation , abstract , references , index terms 

Performance is an important quality attribute that needs to be planned and managed 
proactively. Abstract models of the system are not very useful if they do not produce 
reasonably accurate metrics. Detailed models are time consuming and expensive to build as 
well as to simulate. In order to strike a right balance, a framework is proposed in this paper 
that takes advantage of the flexibility of abstract modeling and intricacies of detailed 
modeling. Performance is modeled and verified per use cas ... 

Keywords: performance analysis, system level modeling, use case verification 



2 Building reliable mobile-aware applications using the Rover toolkit 
Anthony D. Joseph, M. Frans Kaashoek 
October 1997 Wireless Networks, volume 3 issue 5 
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This paper discusses extensions to the Rover toolkit for constructing reliable mobile-aware 
applications. The extensions improve upon the existing failure model, which addresses client 
or communication failures and guarantees reliable message delivery from clients to server, 
but does not address server failures (e.g., the loss of an incoming message due to server 
failure) (Joseph et al., 1997). Due to the unpredictable, intermittent communication 
connectivity typically found in mobile clien ... 

^ Building reliable mobile-aware applications using the Rover toolkit 
Anthony D. Joseph, M. Frans Kaashoek 

November 1996 Proceedings of the 2nd annual international conference on Mobile 
computing and networking 
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April 1990 ACM SIGPLAN Notices, Volume 25 issue 4 

Full text available: ^ pdf(790.56 KB) Additional Information: full citation , abstract , references 

Interactors is a run-tinne environment for embedded real-time software, which adds 
concurrency to the C++ object-oriented language. Interactors allows sequential processes to 
interact synchronously or asynchronously, and provides user-definable multiparty 
interactions. Several forms of selective wait, inspired by Ada, are provided. Scheduling 
algorithms follow Carnegie-Mellon University's recommendations for implementing hard 
deadline scheduling. Concepts are illustrated by simple a ... 

5 Partial reconfigurable architectures: Real-time LUT-based network topologies for Q 
dynamic and partial FPGA self-reconfiguration 
Michael Huebner, Tobias Becker, Juergen Becker 

September 2004 Proceedings of the 17th symposium on Integrated circuits and system 
design 

Full text available:* ^ pdf(356.70 KB) Additional Information: full citation , abstract , references , index terms 

Xilinx Virtex FPGAs offer the possibility of dynamic and partial run-time reconfiguration. If a 
systenn uses this feature the designer has to take care, that no signal lines cross the border 
to other reconfigurable regions. Traditional solutions connecting modules on a dynamic and 
partial reconfigurable system use TBUF elements for connection and separation of the 
functional blocks. While automalically placing and routing the design, the routing-tool 
sonnetimes uses signal lines which cross the mo ... 

Keywords: dynamic partial reconfiguration, virtex 
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7 Special issue on wireless pan & sensor networks: A study of energy consumption and 
reliability in a multi-hop sensor network 
Jonathan M. Reason, Jan M. Rabaey 

January 2004 ACM SIGMOBILE Mobile Computing and Communications Review, Volumes 
Issue 1 

Full text available: ^ pdf(477.91 KB) Additional Information: full citation , abstract , references 

For a moderate-size, multi-hop, sensor network, we present experimental measurements of 
radio energy consumption and packet reliability. We categorize the energy measurements by 
energy consumed in each radio state and for each traffic type. Packet reliability results are 
presented from a network and link perspective, whereas prior work only considered the 
former. We introduce a novel technique of application-aware radio duty cycling called on- 
demand spatial TDMA. When compared to the non-cycling ... 

Q Papers: Wireless data communications using DECT air interface 
Antonio Muchaxo, Alexandre Sousa, Nuno Pereira, Helena Sarmento 
April 1999 ACM SIGCOMM Computer Communication Review, Volume 29 issue 2 

Full text available: ^ pdf(1.25 MB) Additional Information: full citation , abstract , references , citings 



DECT Is an approved ETSI standard for cordless communications, defined as a general radio 
access technology that can be used as the air interface to any network. In addition to the 
well-established voice service, it supports data communications. DECT currently addresses 
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low bit rates, but additional modulation options have recently been Included for high-speed, 
up to 2Mbps. In this paper, we describe the hardware and software design of an entire 
wireless communications system to be used In SOHO ... 
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Soft updates: a solution to the metadata update problem in file systenns 
Gregory R. Ganger, Marshall Kirk McKusick, Craig A. N. Soules, Yale N. Patt 
May 2000 ACM Transactions on Computer Systems (TOCS), Volume 18 issue 2 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: fl pdfd 47.90 KB) 



Metadata updates, such as file creation and block allocation, have consistently been 
identified as a source of performance, integrity, security, and availability problems for file 
systems. Soft updates is an implementation technique for low-cost sequencing of fine- 
grained updates to write-back cache blocks. Using soft updates to track and enforce 
metadata update dependencies, a file system can safely use delayed writes for almost all file 
operations. This article describes soft ... 

2^ Increasing cache port efficiency for dynamic superscalar microprocessors 
Kenneth M. Wilson, Kunle Olukotun, Mendel Rosenblum 

May 1996 ACM SIGARCH Computer Architecture News , Proceedings of the 23rd 

annual international symposium on Computer architecture, volume 24 issue 2 

Additional Information: full citation , abstract , references , citings , index 
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The memory bandwidth demands of modern microprocessors require the use of a multi- 
ported cache to achieve peak performance. However, multi-ported caches are costly to 
implement. In this paper we propose techniques for improving the bandwidth of a single 
cache port by using additional buffering in the processor, and by taking maximum 
advantage of a wider cache port. We evaluate these techniques using realistic applications 
that include the operating system. Our techniques using a single-ported ca ... 

23 Firmware factory & forth 
Brad Eckert 

December 1999 ACM SIGPLAN Notices, Volume 34 issue 12 

Full text available: ^ pdf(373.72 KB) Additional Information: full citation , citings , index terms 
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Daehyun Kim, Mainak Chaudhurl, Mark Heinrich 
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Active mennor7 systems help processors overcome the memory wall when applications 
exhibit poor cache behavior. They consist of either active memory elements that perform 
data parallel computations in the memory system itself, or an active memory controller that 
supports address re-mapping techniques that improve data locality. Both active memory 
approaches create coherence problems— even on uniprocessor systems— since there are 
either additional processors operating on the data directly, or the ... 

Keywords: active memory, address re-mapping, cache coherence 
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July 1998 Proceedings of the 12th international conference on Supercomputing 
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26 Location-awareness and interworking: Proximity services supporting network virtual 
memory in mobile devices 

Emanuele Lattanzi, Andrea Acquaviva, Alessandro Bogliolo 

October 2004 Proceedings of the 2nd ACM international workshop on Wireless mobile 
applications and services on WLAN hotspots 

Full text available: ^ pdfd 92.89 KB) Additional Information: full citation , abstract , references , index terms 

Wireless networked embedded terminals like personal digital assistants, cell-phones or 
sensor nodes are typically memory constrained devices. This limitation prevents the 
development of applications that require a large amount of run-time memory space. In a 
wired cum wireless scenario, a potentially unlimited amount of virtual memory can be found 
on remote servers installed on the wired network. However, virtual memory access requires 
performance constrained and lossless data flows against te ... 

Keywords: mobility management, network swapping, proximity service, wireless networks 



27 Cellular Disco: resource management using virtual clusters on shared-memory 
multiprocessors 

Kinshuk Govil, Dan Teodosiu, Yongqiang Huang, Mendel Rosenblum 

December 1999 ACM SIGOPS Operating Systems Review , Proceedings of the 

seventeenth ACM symposium on Operating systems principles, Volume 33 

Issue 5 

Full text available: I sS pdfn.93 MB) Additional Information: full citation , abstract, references , citings, index 
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Despite the fact that large-scale shared-memory multiprocessors have been commercially 
available for several years, system software that fully utilizes all their features is still not 
available, mostly due to the complexity and cost of making the required changes to the 
operating system. A recently proposed approach, called Disco, substantially reduces this 
development cost by using a virtual machine monitor that leverages the existing operating 
system technology. In this paper we present a syste ... 

28 Where is time spent in message-passing and shared-memory programs? 
Satish Chandra, James R. Larus, Anne Rogers 

November 1994 Proceedings of the sixth international conference on Architectural 

support for programming languages and operating systems, Volume 29 , 
28 Issue 11 , 5 
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Message passing and shared mennory are two techniques parallel programs use for 
coordination and communication. This paper studies the strengths and weaknesses of these 
two mechanisms by comparing equivalent, well-written message-passing and shared- 
memory programs running on similar hardware. To ensure that our measurements are 
comparable, we produced two carefully tuned versions of each program and measured them 
on closely-related simulators of a message-passing and a shared-memory machine, ... 

29 Cellular disco: resource management using virtual clusters on shared-mennory 
multiprocessors 

Kinshuk Govil, Dan Teodoslu, Yongqiang Huang, Mendel Rosenblum 

August 2000 ACM Transactions on Computer Systems (TOCS), Volume 18 issue 3 

Additional Information: full citation , abstract , references , citings , index 
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Despite the fact that large-scale shared-memory multiprocessors have been commercially 
available for several years, system software that fully utilizes all their features is still not 
available, mostly due to the complexity and cost of making the required changes to the 
operating system. A recently proposed approach, called Disco, substantially reduces this 
development cost by using a virtual machine monitor that laverages the existing operating 
system technology. In this paper we present a ... 

Keywords: fault containment, resource managment, scalable multiprocessors, virtual 
machines 



30 Integrating perfornnance monitoring and communication in parallel computers | 
Margaret Martonosi, David Ofelt, Mark Helnrich 

May 1996 ACM SIGMETRICS Performance Evaluation Review , Proceedings of the 1996 
ACM SIGMETRICS international conference on Measurement and modeling 
of computer systems, volume 24 issue i 

Full text available* df{T 49 MB) Additional Information: full citation , abstract , references , citings , index 
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A large and increasing gap exists between processor and memory speeds in scalable cache- 
coherent multiprocessors. To cope with this situation, programmers and compiler writers 
must increasingly be aware of the memory hierarchy as they implement software. Tools to 
support memory performance tuning have, however, been hobbled by the fact that it is 
difficult to observe the caching behavior of a running program. Little hardware support 
exists specifically for observing caching behavior; furthermore ... 

^'^ Impala: a middleware system for managing autononnic, parallel sensor systems | 
Ting Liu, Margaret Martonosi 

June 2003 ACM SIGPLAN Notices , Proceedings of the ninth ACi^ SIGPLAN symposium 

on Principles and practice of parallel programming, volume 38 issue lo 
Full text available' ^ pdf(684 33 KB) Additional Information: full citation , abstract , references , citings , index 
^ terms 

Sensor networlcs are long-running computer systems with many sensing/compute nodes 
working to gather information about their environment, process and fuse that Information, 
and In some cases, actuate control mechanisms in response. Lilce traditional parallel 
systems, communication between nodes is of fundamental Importance, but is typically 
accomplished via wireless transceivers. One further key attribute of sensor networks is that 
they are almost always long running systems. Intended to operate i ... 

Keywords: middleware system, sensor networks, software adaptation, software update 
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32 Disco: running commodity operating systems on scalable multiprocessors | 
Edouard Bugnion, Scott Devine, Kinshuk Govil, Mendel Rosenblum 
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In this article we examine the problem of extending modern operating systems to run 
efficiently on large-scale shared-memory multiprocessors without a large implementation 
effort. Our approach brings back an idea popular in the 1970s: virtual machine monitors. We 
use virtual machines to run multiple commodity operating systems on a scalable 
multiprocessor. This solution addresses many of the challenges facing the system software 
for these machines. We demonstrate our approach with a prototy ... 

Keywords: scalable multiprocessors, virtual machines 



33 Integrating reliable memon/ in databases 
Wee Teck Ng, Peter M. Chen 

August 1998 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 7 Issue 3 

Full text available: ^ pdf(123.18 KB) Additional Information: fuli citation , abstract , index terms 

Recent results in the Rio project at the University of Michigan show that it is possible to 
create an area of main memory that is as safe as disk from operating system crashes. This 
paper explores how to integrate the reliable memory provided by the Rio file cache into a 
database system. Prior studies have analyzed the performance benefits of reliable memory; 
we focus instead on how different designs affect reliability. We propose three designs for 
integrating reliable memory into databases: non ... 

Keywords: Main memory database system (MMDS), Recovery, Reliability 
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i-iii ^ I ui 0t ^^/^ 07 n^D\ Additional Information: full citation , abstract , references , citings , index 
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This paper introduces dynamic self-invalidation (DSI), a new technique for reducing cache 
coherence overhead in shared-memory multiprocessors. DSI eliminates invalidation 
messages by having a processor automatically invalidate its local copy of a cache block 
before a conflicting access by another processor. Eliminating invalidation overhead is 
particularly important under sequential consistency, where the latency of invalidating 
outstanding copies can increase a program's critical path. DSI is ap ... 

Hardware/Software Co-testing of Embedded Memories in Complex SOCs 
Bai Hong Fang, Qiang Xu, Nicola Nicolici 

November 2003 Proceedings of the 2003 IEEE/ACM international conference on 
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and dedicated design for test (DFT) hardwaresuch that the functional power constraints are 
not exceededduring test while trading-off the testing time agalnstDFT area and performance 
overhead. The suitability ofsoftware-centric and hardware-centric approaches for 
embeddedmemory testing is examined and to combine the advanta 
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Future parallel computers must efficiently execute not only hand-coded applications but also 
programs written in high-level, parallel programming languages. Today's machines limit 
these programs to a single communication paradigm, either message-passing or shared- 
memory, which results in uneven performance. This paper addresses this problem by 
defining an interface, Tempest, that exposes low-level communication and memory-system 
mechanisms so programmers and compilers can customize polici ... 
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The performance portability of parallel progranns across a wide range of ennerging coinerent 
shared address space systems is not well understood. Programs that run well on efficient, 
hardware cache-coherent systems often do not perform well on less optimal or more 
commodity-based communication architectures. This paper studies this issue of performance 
portability, with the commodity communication architecture of interest being page-grained 
shared virtual memory. We begin with applications that per ... 
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application can improve the performance of software shared-memory programs 
substantially. Existing systems for using customizable protocols are hard to use directly 
because the mechanisms they provide for manipulating protocols are low-level ones. This 
article is an in-depth study of the issues involved in providing language support for 
application-specific protocols. We describe the design and implementat ... 

Keywords: parallel processing 
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